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ABSTRACT 

Nuclear factors NF90 and NF45 form a complex 
involved in a variety of cellular processes and are 
thought to affect gene expression both at the tran- 
scriptional and translational level. In addition, this 
complex affects the replication of several viruses 
through direct interactions with viral RNA. NF90 
and NF45 dimerize through their common 'DZF' 
domain (domain associated with zinc fingers). 
NF90 has additional double-stranded RNA-binding 
domains that likely mediate its association with 
target RNAs. We present the crystal structure of 
the NF90/NF45 dimerization complex at 1.9-A reso- 
lution. The DZF domain shows structural similarity 
to the template-free nucleotidyltransferase family of 
RNA modifying enzymes. However, both NF90 and 
NF45 have lost critical catalytic residues during evo- 
lution and are therefore not functional enzymes. 
Residues on NF90 that make up its interface with 
NF45 are conserved in two related proteins, 
spermatid perinuclear RNA-binding protein (SPNR) 
and zinc-finger RNA-binding protein (Zfr). Using a 
co-immunoprecipitation assay and site-specific 
mutants, we demonstrate that NF45 is also able to 
recognize SPNR and Zfr through the same binding 
interface, revealing that NF45 is able to form a 
variety of cellular complexes with other DZF- 
domain proteins. 

INTRODUCTION 

Nuclear factors NF90 (also known as ILF3, DRBP76 or 
NFAR-1) and NF45 (also known as ILF2) form a protein 
complex that regulates gene expression at multiple levels 
(1,2). The complex was first identified as a transcrip- 
tion factor involved in IL-2 transcription in activated 



T-cells (3,4). Subsequently the NF90/NF45 complex has 
been identified in a number of different contexts. For 
example, NF90/NF45 has been implicated in DNA 
binding, transcription and repair, although it does not 
contain classical DNA-binding domains (3,5-8). On the 
other hand, NF90 contains classical double-stranded 
RNA-binding domains (dsRBDs) and is proposed to 
play a role in gene expression at the RNA level, either 
through direct association with mRNAs or through re- 
pression of miRNA metabolism (9-15). 

In addition to its normal cellular roles, NF90/NF45 
plays a critical role in the life cycle of several viruses, 
such as Hepatitis C virus (HCV), Bovine Viral 
Diarrhoea Virus (BVDV), Dengue virus and human im- 
munodeficiency virus (HIV) that have a global impact on 
human and animal health (13,16-19). In the case of 
plus-stranded RNA viruses such as HCV and Dengue 
virus, NF90/NF45 is thought to associate with the viral 
genomic RNA through regulatory RNA structures in the 
5'- and 3'-non-translated regions and has been observed to 
co-localize with viral replication foci in the cytoplasm 
(18,19). Plus-stranded RNA viral genomes resemble 
cellular mRNAs and it is possible that a normal cellular 
role of NF90/NF45 is co-opted by the viruses to further 
their own ends. Indeed, in the case of HIV, NF90 impacts 
on viral replication by modulating translation of Cyclin 
Tl through association with the 3'-untranslated region of 
Cyclin Tl mRNA (13). Knock-down of NF90 causes a 
shift of Cyclin Tl mRNA from polysomes to monosomes 
and the consequent reduction in Cyclin Tl protein reduces 
HIV proliferation. 

NF90/NF45 has been shown to bind to several specific 
mRNAs other than Cyclin Tl, including MKP-1, IL-2, 
acid P-glucosidase and to affect either translation or 
turnover of these mRNAs (9,11,15). Global RNA 
immunoprecipitation studies on the other hand, have 
shown that the complex can associate with a wide 
variety of mRNAs and non-coding RNAs (12,14,20). 
Together, these studies indicate a role for NF90/NF45 in 
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post-transcriptional control of gene expression. However, 
what motifs are recognized by NF90/NF45 and what the 
molecular consequences are of NF90/NF45 association 
with mRNAs are not yet well understood. Clearly, a 
better understanding of the molecular role of this 
complex would provide valuable insights into its 
function both in viral infection and in normal cellular 
processes. 

NF90 is found only in vertebrates while NF45 is found 
throughout the metazoa. In mammals, NF90 and NF45 
show a broad tissue distribution but are particularly 
abundant in brain, spleen, testes and liver (14,21,22). 
Although NF90 and NF45 are nominally nuclear 
factors, the protein complex has been shown to shuttle 
between the nucleus and cytoplasm and certain splice 
variants of NF90 are preferentially located in the cyto- 
plasm in normal tissue (14,15). 

NF90 has a domain architecture that consists of an 
N-terminal domain with no known function, called a 
'domain associated with zinc fingers' or DZF domain. 
This domain is followed by two dsRBDs and a 
low-complexity region at the C-terminus. The C-terminal 
region is the site of most sequence variations generated by 
alternative splicing and longer splice variants of NF90 
show an enrichment of arginine, glycine and serine 
residues in this region (23). The DZF domain is found 
in three other proteins, namely NF45, spermatid peri- 
nuclear RNA-binding protein (SPNR, also known as 
STRBP) and Zinc-finger protein associated with RNA 
(Zfr). SPNR is a testes-specific paralogue of NF90 that 
has a similar domain organization while Zfr, an essential 
gene in mice and flies, has a different domain organiza- 
tion. In Zfr the DZF domain is found at the C-terminus 
and is preceded by several RNA-binding zinc fingers. 
NF45, the partner of NF90, is much smaller than the 
other proteins in the family. The DZF domain of NF45 
is the only folded domain present in the protein sequence 
and is flanked by an arginine- and glycine-rich sequence at 
the N-terminus and a glutamate-rich sequence at the 
C-terminus (Figure 1A). 

While NF90/NF45 is thought to have a role in many 
aspects of gene expression and RNA metabolism, the mo- 
lecular mechanisms responsible for these various processes 
are poorly understood. One possibility is that the dsRBDs 
of NF90 might direct the complex to its RNA targets, 
although the specificity of such an interaction remains un- 
defined. Furthermore, the DZF domain is thought to 
mediate dimerization of NF90 with NF45 but any 
further role that this domain might have is not known. 
To gain insight into the possible function of NF90/ 
NF45, we have solved the crystal structure of a truncated 
complex that contains the DZF dimerization domains. 
Our work demonstrates that the DZF domain has 
evolved from a family of RNA metabolizing enzymes. 
Based on the conservation of the NF90 dimer interface, 
we were able to identify other putative binding partners of 
NF45. We demonstrate, using a cell-based assay that 
NF45 can dimerize with other DZF-domain containing 
proteins through a structurally conserved interface. This 
finding suggests that NF45 may have a previously 



unappreciated role in other complexes that are involved 
in RNA metabolism. 

MATERIALS AND METHODS 

Purification of NF90/NF45 constructs 

Mouse cDNAs for NF90 and NF45 were obtained from 
Imagene. Expression constructs of NF90 (residues 1-590 
and residues 380-590) were cloned as N-terminal 
hexahistidine- tagged fusion proteins into pETMCN (24). 
Further C-terminal deletion constructs were made by 
site-directed mutagenesis to add stop codons to the 
NF90AC591 construct. NF45 (residues 29-390) was 
cloned into pGEX-6-p-l, with a N-terminal GST fusion 
(GE Healthcare). For crystallization, NF90AC381 and 
NF45AN28 were expressed separately in Escherichia coli 
BL-21 Gold pLysS cells and co-lysed in 20 mM Tris-HCl 
pH 7.5, 500 mM NaCl 1 mM DTT. The complex was 
purified in batch from cleared lysates using GSH resin 
and eluted by overnight digestion with tobacco etch 
virus (TEV) and 3C proteases. The protein was dialysed 
into a low salt buffer (20 mM Tris-HCl pH 7.5, 50 mM 
NaCl, 1 mM DTT). Further purification was carried out 
using heparin sepharose with a salt gradient to lOOOmM 
NaCl, and size exclusion chromatography (20 mM Tris- 
HCl pH 7.5, 150mM NaCl, 1 mM DTT). 

Crystallization 

The NF90AC381/NF45AN28 complex was concentrated 
to 8 mg/ml and crystallized by vapour diffusion in a sitting 
drop with a reservoir solution of 100 mM MES pH 6.0, 
0.22 M sodium malonate and 18% PEG 8000. Initial 
crystals diffracted poorly and showed multiple lattices. 
Crystals were improved using serial streak seeding. 
Highly diffracting crystals grew in space group 7222. 
However many crystals were also found that grew in 
related C2 space groups where one of the symmetry axes 
of 7222 had degenerated. 

Data collection and structure solution 

The crystals were cryoprotected in crystallization buffer 
supplemented with 10% glycerol and flash cooled to 
100 K in a liquid nitrogen cryostream. For phasing experi- 
ments, the crystals were soaked for 1-2 h with K 2 AuCN 4 
prior to cryoprotection and cooling. Data were collected 
at the X06SA beamline at the Swiss Light Source (Villigen, 
Switzerland) using the PILATUS detector (25). The data 
were indexed and integrated using XDS (26). The heavy 
atom substructure was determined using SHELX (27) and 
phases were improved using SHARP (28). Approximately 
70% of the structure was completed by autobuilding with 
BUCCANEER (29,30). Further building was carried out 
manually using the graphics software COOT (31) and the 
structure was further refined against a higher resolution 
native dataset with PHENIX (32). The final statistics for 
the geometry of the structure were calculated using 
MOLPROBITY (33). Figures were prepared using 
Pymol (The PyMOL Molecular Graphics System, 
Version 1.3, Schrodinger, LLC). 
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Figure 1. Overview of the NF90/NF45 DZF dimerization domain structure. (A) Schematic representation of the domain organization of NF45 
(green) and NF90 (blue). Double-headed arrows underneath indicate the sub-fragments of each protein that were used for crystallization. 
(B) Overview of the structure showing a 'side' view and a 'top' view rotated 90° around the horizontal axis. (C) Superposition of NF90 (blue) 
on NF45 (gray) using the same orientations of NF45 as shown in B. Extensions at the N- and C-termini of NF45 that are involved in dimerization 
interactions are shown in yellow. A C-terminal extension of NF90 is shown in magenta with a dotted line indicating a missing loop. 
(D) Superposition of NF45 (grey) on yeast Poly (A) polymerase (PDBid 2q66, brown) and CCA-adding enzyme from Archaeoglobus fulgidis 
(PDBid ltfw, pink), shown from the 'top' view from (B). 



Nucleotide triphosphate soaking/co-crystallization and 
data analysis 

To overcome problems of dual occupancy of nucleotides 
and malonate in the putative actives site of NF90AN381/ 
NF45AN28, the protein was rescreened for alternative 
crystallization conditions. Crystals were grown in 
12-14% PEG3350, 200 mM MgCl 2 , 100 mM MES pH 
6.5 and 5% glycerol and were either soaked in a 
cryoprotecant containing lOmM of the nucleotide tri- 
phosphate (NTP) or co-crystallized in the presence of 
lOmM NTP. The crystals were cryoprotected in the crys- 
tallization buffer supplemented with 20% glycerol and 



flash-cooled in liquid nitrogen to 100 K. Data were col- 
lected at beamlines 102 and 104-1 at Diamond Light 
Source (Didcot, UK). As with native crystals, data from 
NTP-bound crystals were indexed and integrated using 
XDS (26). Crystals grown in PEG3350 grew either in 
space group 7222 with one molecule per asymmetric unit 
or P2i with two molecules per asymmetric unit. Molecular 
replacement using the native structure was carried out 
with Phaser (30,34) followed by refinement in PHEN1X 
(32) and model building with COOT (31). In the absence 
of ligand, clear positive difference density was observed in 
the F a —F c maps for the nucleotides. However during re- 
finement after fitting the ligands, negative F 0 -F c peaks 
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appeared suggesting that the ligands were not present at 
full occupancy. The occupancy of the ligands was adjusted 
manually to model the best fit with the electron density 
map and not refined. 

Co-immunoprecipitation 

Sequences for human Zfr and SPNR were obtained as 
cDNAs from Imagene. Vectors containing human 
NF90b and NF45 cDNAs were a kind gift from S-E. 
Behrens (Martin Luther University, Halle). Zfr, SPNR, 
NF90b and NF45 were cloned into pEF-based mamma- 
lian expression vectors (a kind gift from Elena Conti, MPI 
Biochemistry, Martinsried) containing either an 
N-terminal HA or an N-terminal HA-FLAG epitope 
tag. Site-directed mutants were generated by PCR 
amplification of the vector by Pfu Ultra polymerase 
(Stratagene) with overlapping oligonucleotide primers 
that encoded the mutation. Vector DNA was transiently 
transfected into 293T cells in six-well plates (Iwaki) 
growing in DMEM with foetal bovine serum and penicil- 
lin/streptomycin (Gibco). An amount of 4ug of vector 
DNA was transfected using 6ul lipofectamine 2000 
(Invitrogen) in a total volume of 200 ul OptiMEM 
(Gibco). The medium was replaced after 12 h and cells 
were harvested after 36 h. The cells were placed on ice 
and washed with 2 ml PBS and then lysed by incubation 
with 250 ul NET-G buffer (50 mM Tris-HCl pH 7.5, 
150mM NaCl, 0.1% NP40, 1 mM EDTA, 10% glycerol, 
protease inhibitor cocktail (Roche), 3U RNase A/Tl mix 
(Fermentas) and 0.6 U RQ1 DNase (Promega)) for 15min 
at 4°C. The lysed cells were then resuspended from the 
plates and centrifuged at 16000g for 15min at 4°C to 
remove insoluble debris. Ten percent of the soluble 
fraction was removed for sodium dodecyl sulphate-poly- 
acrylamide gel electrophoresis (SDS-PAGE) analysis 
(input) while the remainder of the soluble fraction was 
incubated with 15ul a-FLAG resin with rotation at 4°C 
for 1 h. The samples were then washed three times in 
NET-G buffer and excess liquid was removed with a 
gel-loading tip. Proteins were eluted from the a-FLAG 
resin by incubation with 3x FLAG peptide (0.2mg/ml 
in NET-G) at 4°C for 15min. Samples from input and 
eluted proteins were separated by SDS-PAGE, 
transferred onto a nitrocellulose membrane (Whatman) 
and probed with a mouse a-HA antibody (Covance) at a 
1:2500 dilution in 5% milk in PBS-T. After washing in 
PBS-T, the blot was then probed with a secondary goat 
a-mouse antibody (Biorad) at a dilution of 1:10000 in 5% 
milk/PBS-T. After washing off the secondary antibody, 
the blot was visualized with Super West pico ECL 
reagent (Pierce) and exposed to film. 

RESULTS 

Structural studies 

Structural studies were carried out with mouse NF90 and 
NF45. Mouse and human NF90 show 95% sequence 
identity while mouse and human NF45 are identical at 
the protein level. Both NF90 and NF45 can be expressed 
recombinantly in E. coli as soluble proteins. However, 



attempts to purify the proteins separately were problem- 
atic as NF45 showed a tendency to aggregate and precipi- 
tate. Thermal denaturation studies of NF45 showed that it 
has a melting temperature (T m ) of 25°C while a complex of 
the DZF domains of NF90 and NF45 showed a T m of 
55°C indicating that the presence of NF90 greatly 
enhances the stability of NF45 (Supplementary Figure 

51) . A co-lysis and co-purification approach allowed a 
1:1 complex of NF90/NF45 to be isolated that showed 
no aggregation. Indeed, the complex is stable even in 
high salt concentrations of 0.5 M. 

NF90 contains two dsRBDs that have previously been 
shown to bind dsRNA (35,36). To assess the integrity of 
the purified proteins, electrophoretic mobility shift assays 
(EMSA) were carried out with both single-stranded (ss) 
and double-stranded (ds) RNA substrates. A long con- 
struct NF90AC591/NF45 (which includes the DZF and 
dsRBD domains of NF90 but not the C-terminal low 
complexity region) and a fragment containing only the 
dsRBDs of NF90 (residues 380-590) both showed 
dsRNA-binding activity as expected. A shorter construct 
containing only the DZF dimerization domains 
(NF90AC381/NF45AN28) showed no mobility shift 
with either ss or dsRNA indicating that this fragment is 
not necessary for RNA binding (Supplementary Figure 

52) . However, the long NF90AC591/NF45 complex 
shows qualitative differences compared with the dsRBD 
construct: the long complex shows apparently tighter 
binding and less heterogeneity than the dsRBDs alone. 
This suggests that the DZF domain, while not required 
for RNA binding, might nevertheless contribute to recog- 
nition of RNA substrates. 

Initial attempts to crystallize the NF90AC591/NF45 
construct were not successful, most likely because of the 
multiple domains present in NF90. A series of C-terminal 
truncations of NF90 were designed based on sequence 
analysis and limited proteolysis. Expression trials 
showed that NF90 constructs truncated beyond residue 
380 were not soluble. Well-diffracting crystals were 
obtained with complexes that contained residues 1-380 
of NF90 and 29-390 of NF45. An overview of the 
domain structure of NF90 and NF45 is shown in 
Figure 1A, with the constructs used for crystallization 
indicated underneath. 

Phases for the structure were obtained using single 
wavelength anomalous dispersion (SAD) from gold- 
derivatized crystals that diffracted to 3.3 A. The structure 
was refined against a native dataset that diffracted to 1.9 A 
with a i? wor k = 18.5% and i? free = 21.0% and good stereo- 
chemistry (Table 1). The model includes 658 residues out 
of a total of 741 that were present in the crystal. Residues 
in the N- and C-termini and several loops (residues 1-5, 
55-86, 341-353 and 375-380 of NF90 and 362-390 of 
NF45) could not be modelled due to poor electron 
density in these regions that made a clear interpretation 
of the map difficult. 

The DZF dimerization domains of NF90 and NF45 
form an oblong structure with a flat face on one side 
and a curved face on the other (Figure IB). Each 
protein shows a bipartite domain, characterized by an 
N-terminal mixed a-(3 region that contains a central 
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Table 1. Data collection, phasing and refinement statistics for NF90/ 
NF45 native structure 
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anti-parallel (3-sheet and a C-terminal a-helical region 
(Figures IB and 2). NF90 and NF45 have 30% sequence 
identity over the DZF domain and superpose with a root 
mean square deviation (r.m.s.d.) of 2.0 A over 261 residues 
out of 334 (Figure 1C). This gives the overall structure a 
pseudo 2-fold rotational symmetry. The central (3-sheet of 
each domain forms the base of a cleft between the N- and 
C-terminal halves while dimerization of NF90 and NF45 
is mediated by the ot-helices at the C-terminus of each 
protein. NF45 has an extension to the DZF domain at 
the N-terminus, consisting of residues 29^17, that contrib- 
utes to dimerization (see below). NF90 and NF45 also 
differ at their C-termini: residues 354-374 of NF90 pack 
in an extended conformation against the flat side of NF90. 
In contrast, residues 343-361 of NF45 follow a different 
path, forming a short P-hairpin and then binding in an 
extended conformation along the dimerization interface 
(Figure 1C). 

A structural similarity search using the DALI server 
showed that the DZF domains of NF90 and NF45 have 
significant structural similarity to members of the 
template-free nucleotidyltransferase family that includes 
poly (A) polymerases, TUTases, 2'-5' oligoadenylate 
synthase and the CCA-adding enzymes of tRNAs 
(Figure ID and Supplementary Figure S3). Template- 
free nucleotidyltransferases belong to the DNA polymer- 
ase (3 family of nucleotidyltransferases (37,38). None of 
the template-free nucleotidyltransferases for which struc- 
tures are known encode dsRBDs for substrate recognition. 
However both poly (A) polymerase and the CCA-adding 
enzyme have RNA-binding domains in addition to their 



catalytic domains that aid with substrate recognition or 
processivity. The dsRBD domains of NF90 emanate from 
the flat side of the molecule, on the opposite face to that 
observed in other template-free nucleotidyltransferases 
(Supplementary Figure S3). However, the linker region 
between the end of the visible C-terminus and the first 
dsRBD is ~23 residues, which would be sufficient to 
allow the dsRBDs to access the same face of the 
molecule as the auxiliary RNA-binding domains 
observed in other template-free nucleotidyltransferases. 

This family of enzymes catalyses the transfer of one 
NTP to the end of an existing oligonucleotide substrate. 
These enzymes have a catalytic signature sequence that 
includes three negatively charged residues (in yeast poly 
(A) polymerase, Papl, these are residues Asp 100, Asp 102 
and Asp 154) at the base of the catalytic cleft, on the 
exposed surface of the P-sheet (Figure 3A). These nega- 
tively charged residues are required to position two 
divalent metal ions (typically two Mg 2+ ions or Mg 2+ 
and Mn 2+ ) that co-ordinate the close approach of the 
phosphate groups of an incoming NTP to the substrate 
RNA primer (Figure 3A). In addition to this catalytic 
triad, a serine residue (Ser89 in Papl) and a lysine 
residue (Lys215) are conserved in the template free 
nucleotidyltransferases and help to orient the y-phosphate 
of the NTP. 

Neither NF90 nor NF45 has a full complement of these 
critical residues in their putative active sites. In NF45, two 
of the characteristic negatively charged residues, equiva- 
lent to Asp 100 and Asp 154, have been replaced with a 
valine residue (Vail 19) and a lysine residue (Lysl74), re- 
spectively (Figure 3B). In NF90, the residue equivalent 
to Aspl54 is replaced by a threonine residue (Thrl68, 
Figure 3C). In addition, the serine and lysine residues at 
the base of the nucleotide-binding site are replaced by a 
leucine (Leu97) and an arginine residue (Arg234), respect- 
ively (Figure 3C). The structural analysis strongly indi- 
cates that the DZF domains of NF90 and NF45 are 
unlikely to possess nucleotidyltransferase activity. We 
further carried out an in vitro nucleotidyltransferase 
assay using a-labelled ATP or UTP. Although activity 
could be shown for a known poly (A) polymerase, no 
activity was detectable for NF90/NF45 in the conditions 
we used for this assay (Supplementary Figure S4) 

Nucleotide binding to NF90/NF45 

Although it is unlikely that NF90/NF45 could actually 
catalyse nucleotidyl transfer, prominent cleft regions of 
both NF90 and NF45 are conserved in both proteins 
(Supplementary Figure S5) suggesting that this region of 
the protein might nevertheless be functionally important. 
Furthermore, a malonate ion from the crystallization 
solution, which is negatively charged, was found in the 
cleft of both NF90 and NF45 in approximately the 
position where the triphosphate moiety of nucleotides is 
bound in other template-free nucleotidyltransferases. This 
suggested that NF90/NF45 might bind nucleotide ligands. 

To identify potential nucleotide ligands, NF90AC381/ 
NF45AN28 crystals were soaked with ATP, UTP, CTP 
and GTP. Initial data from soaking experiments showed 
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Figure 2. Conservation of NF90 and NF45 sequences. (A) Multiple sequence alignment of NF45 with sequences from mouse (M.m) zebrafish (D.r.) 
Drosphila (D.m.) and Ciona intestinalis (C.i.)- A structure-based alignment of NF45 with yeast poly (A) polymerase is shown underneath. Human 
NF45 was not included as it has 100% identity with mouse NF45. Secondary structure elements are shown Linderneath as bars (a-helices) or arrows 
((3-strands). Blue dots indicate residues that interact with NF90, pink dots indicate residues that interact with ATP and black boxes show residues 
equivalent to those involved in catalysis in functional nucleotidyl transferases. Purple boxes indicate the two residues on helix 9 that were mutated to 
disrupt dimer formation. (B) Sequence alignment of NF90 with sequences from mouse (M.m.), human (H.s.), zebrafish (D.r.) and Xenopus (X.I.). 
NF90 is only found in vertebrates. Green dots indicate residues that are involved in dimerization interaction with NF45. 



extra density in the cleft of NF45 that was overlapping but 
larger than the electron density of the malonate ion. This 
suggested that nucleotides were competing with malonate 
ions for binding. An alternative crystallization form was 
identified that grew with PEG3350 as a precipitant and did 
not require sodium malonate. Crystals grown in PEG3350 
belong either to 7222 or to P 2\ space groups and diffract 
to lower resolution than crystals grown in sodium 
malonate. Several datasets of these crystals that were 



either grown or soaked with different nucleotides were 
collected and analysed (Table 2). Datasets from crystals 
co-crystallized with ATP or soaked with UTP (at 2.7 and 
2.8 A resolution, respectively) showed clear electron 
density for the ligand in the cleft of NF45. A dataset 
where CTP was co-crystallized with the complex diffracted 
to 3.1 A resolution and also showed positive difference 
density at the same site but the density for the nucleotide 
was less well defined. This could be because CTP binds less 
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Figure 3. A close-up view of the active site of poly (A) polymerase and equivalent regions in NF90 and NF45. (A) The catalytic cleft of yeast poly 
(A) polymerase (derived from PDBids 2q66 and 3c66) showing the catalytic residues and the bound ATP and a poly (A) substrate. (B) and (C) top 
view of NF45 and NF90 oriented as in Figure 1C, looking into the cleft between the two domains. The residues shown are those in equivalent 
positions to catalytic and nucleotide-binding residues found in related nucleotidyl transferases. (D) ATP bound to the cleft of NF45. A 2F Q -F C map 
from the final refined structure is shown, contoured at la. (E) and (F) A similar view as in (D) showing complexes with UTP and CTP, respectively. 



well to this site and so does not saturate the nucleotide- 
binding sites in the crystal. Although a number of datasets 
were collected from crystals grown in the presence of 
GTP, none showed any evidence for nucleotide binding. 
Thermal denaturation assays with NF45 and the various 
nucleotides showed a similar trend: greater stabilization of 
the protein was seen in the presence of ATP and UTP 
(6-7° C) while CTP and GTP showed a lesser degree of 



stabilization (5°C and 3°C, respectively, Supplementary 
Figure S6A). 

In all of the datasets analysed, the electron density 
was clearly interpretable for the triphosphate moiety of 
the NTPs and a magnesium ion but less clear for 
the bases. A similar effect has been noted in previous 
studies with mitochondrial editosome-like complex 
associated TUTasel (MEAT1) and RNA editing 
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Table 2. Data collection and refinement statistics for NF90AC381/NF45AN28 crystals with bound NTPs 
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terminal uridylyltransferase 2 (RET2) where clear density 
for the triphosphate moiety could be seen with non- 
cognate NTPs but the bases remained poorly defined 
(39,40). The y phosphate of each of the NTPs makes a 
close approach to Serl07 at the base of the cleft and is 
further stabilized by Lys228 (Figure 3D-F). The Mg 2+ ion 
is co-ordinated by an oxygen atom of Aspl21 and three 
oxygen atoms from the a, P and y phosphates of the nu- 
cleotide. The metal is also likely to be co-ordinated by 
water molecules that were not visible at this resolution. 
None of the datasets analysed showed any evidence for 
nucleotide binding to the NF90 cleft. This is unlikely to be 
because of inaccessibility of the cleft. This region of NF90 
shows a similar exposure to solvent channels as the 
equivalent region of NF45. The structural analysis 
points to a crucial difference in the pocket between 
NF90 and NF45, namely the presence of a leucine 
residue, Leu97, at the base of the binding pocket. This 
residue is found in the equivalent position to the conserved 
serine, Serl07, which is present in NF45 and in functional 
template-free nucleotidyltransferases. The proximity of 
Leu97 to Arg234, which is normally a lysine residue in 
other nucleotidyltransferases, may form a steric block 
that prevents the triphosphate moiety from binding. 

Surprisingly, when the binding mode of ATP, UTP and 
CTP are compared in the three structures, the orientation 
of the triphosphate moiety is retained while the positions 
of the bases vary. The orientation of ATP in NF45 shows 
remarkable similarity to ATP binding in Papl. The 
adenine base contacts the edges of His207 on helix 4, 
which in turn is oriented by a pi-stacking interaction 
with Trp243. The adenine moiety makes further van der 
Waals interactions with the side chains of Thr241 and 
Pro242 at the beginning of helix 6. The base of UTP 



flips over to make an alternative set of van der Waals 
contacts with Leu 176 on strand 5 and Ser200 and 
Ala201 on helix 4. The cytosine base is placed in an inter- 
mediate position and has van der Waals interactions with 
His207, Ala203 and Ala204 of helix 4. 

The binding of ATP and UTP with NF45 is something 
of a puzzle. The binding site for these ligands is clearly 
conserved and the position and orientation of the triphos- 
phate moiety is similar to that observed in catalytically 
active template-free nucleotidyltransferases. However, 
the protein does not seem able to use the nucleotides to 
catalyse nucleotidyl transfer and also shows little struc- 
tural perturbation on nucleotide binding (r.m.s.d. values 
between native and nucleotide-bound structures range 
from 0.4 to 0.8 A). One possibility is that nucleotide 
binding is an evolutionary relic. This would be analogous 
to many pseudokinases of the serine/threonine protein 
kinase family. In several cases, structural studies on 
non-functional protein kinases have revealed that some 
of these proteins retain the ability to bind ATP (41,42). 
A more intriguing possibility is that nucleotide binding 
might represent an as yet undefined ligand such as a 
peptide partner, a small molecule or an RNA-associated 
chemical moiety. 

The NF90/NF45 dimerization interface 

The dimerization of NF90 and NF45 is mediated through 
a close anti-parallel association of helix 9 and the inter- 
action of the preceding loop region and C-terminal end of 
helix 8 in each DZF domain. This interaction buries a 
total area of 1698 A 2 , which is sufficient for a biologically 
relevant interaction (43). The helix 9 interface shows a 
remarkable symmetry at its core with three conserved 
residues on NF90 (Glu312, Gln319, Arg323) interacting 
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Figure 4. The NF90/NF45 dimerization interface. (A) Close up view of the dimerization interface in the same orientation as the top view in 
Figure IB. The C-terminal extension of NF45 (yellow) lies in an extended conformation over helix 9 (H9) of NF90 (blue) and NF45 (green). 
Helix 9 shows symmetrical binding interactions at the core. Mutated residues in NF45 are marked with an asterisk. (B) A close up view of the 
interface rotated 180° around the horizontal axis. The N-terminal extension of NF45 is shown in yellow. Conserved residues involved in the 
dimerization interface and in stabilizing the N-terminal extension are shown. (C) A sequence alignment of the dimerization interface of mouse 
NF90 with related DZF domain-containing proteins, SPNR and Zfr. Sequences from mouse (M.m.), zebrafish (D.r.) Xenopus species (X.l. and X.t.) 
and salmon (S.s.) are shown. 



with three similar conserved residues on NF45 (Asp308, 
Gln315, Arg319) (Figure 4A). The glutamine residues 
make a polar contact at the centre of the interaction 
while the Arg323 from NF90 forms a salt bridge with 
Asp308 of NF45, as does Arg319 of NF45 with Glu312 
on NF90. From this interaction one can imagine how this 
heterodimerization interface could have evolved from a 
homodimeric precursor. The symmetry of the interaction 
breaks down at either end of helix 9, with Phe327 at the 
C-terminal end of helix 9 in NF90 contributing a hydro- 
phobic interaction while His323 at the C-terminal end of 
NF45 helix 9 and Arg308 and Gln309 at the N-terminal 
end of NF90 helix 9 contribute to a more hydrophilic 
interface (Figure 4A). Packed against the helix 9-helix 9 
interface is an extended region of the C-terminus of NF45, 



which makes primarily main chain interactions with 
residues on both NF45 and NF90 and contributes a 
tyrosine side chain (Tyr358) to the hydrophobic pocket 
at the N-terminal end of NF45 helix 9. 

NF45 has an N-terminal extension in addition to the 
C-terminal extension that also contributes to the dimer- 
ization interface (Figures 1C and 4B). The N-terminal ex- 
tension of NF45 packs against the opposite face of the 
molecule, on helix 8 and the loop region that connects it 
to helix 9 (Figure 4B). Like the C-terminal extension, the 
N-terminal extension of NF45 interacts with both NF90 
and NF45 at the dimerization interface. However, unlike 
the C-terminal extension, these interactions are primarily 
mediated by side chain interactions that are contributed 
by His29, Pro31, Phe32, Asp33, Phe34, Tyr35 and Phe41. 
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Consequently, the N-terminal extension of NF45 buries a 
large hydrophobic surface area that presumably contrib- 
utes to stabilization of the complex. Additional polar 
interactions are contributed by Asp33, which forms a 
salt bridge with Arg270 on NF45, and His29, which 
hydrogen bonds with Glu270 and Glu277 on NF90. 

The contribution of the N- and C-terminal extensions of 
NF45 to its interaction with NF90 lends asymmetry to the 
interaction interface. It is unlikely that NF45 would be 
able to homodimerize because the extension sequences 
would likely cause a steric clash. Similarly, it also seems 
unlikely that NF90 could homodimerize as the hydropho- 
bic surface that is protected by the N-terminus of NF45 
would be exposed. Interestingly, when NF90 is compared 
with other DZF domain containing proteins such as 
SPNR and Zfr, the residues involved in NF45 interactions 
are relatively well conserved (Figure 3B). Based on these 
observations, two predictions can be made: first; that 
NF90 and NF45 are unable to form homodimers and 
second; that NF45 might be a binding partner for other 
DZF domain-containing proteins, namely SPNR and Zfr. 
Indeed, a TAP-tagging study of the binding partners of 
the Drosophila homologue of Zfr, Zn72D, showed that it 
co-purifies with Drosophila NF45 (44). 

To test whether NF90 and NF45 can homodimerize and 
whether NF45 is a binding partner of other DZF-domain 
containing proteins, a co-immunoprecipitation assay 
using tagged human proteins in mammalian cells was set 
up. For this assay isoform 6 of NF90, NF90b, was used. 
Each protein was cloned as an N-terminally HA-FLAG or 
HA tagged protein and pairs of differentially tagged con- 
structs were co-transfected into 293 T cells. Tagged 
proteins were immunoprecipitated with anti-FLAG resin 
and the presence of binding partners was examined by 
anti-HA western blot (Figure 5). All experiments were 
carried out in the presence of nucleases to remove RNA. 
Control experiments showed that HA-tagged proteins do 
not bind non-specifically to the anti-FLAG resin 
(Figure 5B, lanes 1-3). Co-transfection of HA-NF45 
with HA-FLAG NF45 did not show a complex form on 
co-immunoprecipitation (Figure 5B, lane 4), confirming 
previous observations with in vitro pull down experiments 
(45). Similarly HA-FLAG NF90 is unable to 
co-precipitate HA-NF90. In this case, a C-terminally 
truncated HA-FLAG-tagged mouse NF90AC591 was 
co-transfected with full-length NF90b to differentiate the 
two constructs by size. This confirms that neither NF45 
nor NF90 are able to homodimerize. 

NF45 forms alternative complexes with SPNR and Zfr 

NF90 and NF45 show a robust co-precipitation in this 
assay (Figure 5C, lane 1). To disrupt dimerization, a 
double mutant of NF45 was created where the charges 
were reversed on two of the residues forming the core 
interaction on helix 9 i.e. Asp308 was mutated to 
arginine (D308R) and Arg319 was mutated to glutamate 
(R319E). Although the mutated protein is expressed in the 
transfected cells, there is no association with NF90 in the 
co-precipitation assay, showing that the mutant does 
indeed disrupt binding to NF90 (Figure 5C, lane 2). 



Similar co-immunoprecipitation reactions showed a 
robust interaction between SPNR and NF45 (Figure 5C, 
lanes 3 and 4) and between Zfr and NF45 (lane 6). 
However, when either SPNR or Zfr were co-transfected 
with the mutant NF45, no interaction could be seen (lanes 
5 and 7). This demonstrates that NF45 can form 
complexes with both Zfr and SPNR. The lack of inter- 
action with the mutant shows that NF45 uses the same 
interface to interact with Zfr and SPNR as it uses to 
interact with NF90. The absence of an interaction with 
the mutant is unlikely to be caused by a disruption to 
the structural integrity of the mutant. Thermal denatur- 
ation assays carried out with a purified NF45 mutant 
showed that it has a similar melting profile to the wild- 
type protein and is also stabilized in the presence of UTP 
and ATP (Supplementary Figure S6B). 

Interestingly, in each case where NF90, SPNR and Zfr 
are co-expressed with the mutant NF45, the overall 
amount of protein expressed is lower than when functional 
pairs are expressed together. This effect is seen consist- 
ently over multiple experiments and so is unlikely to be 
caused by variable expression levels that are normally seen 
with transient transfections. It has previously been shown 
that NF90 protein levels are correlated with NF45 levels 
(46). Given the difference in stability of NF45 on its own 
versus in a complex (T m = 25°C versus 55°C) this differ- 
ence in expression is likely to come from the mutual sta- 
bilization effect of the two partner proteins. 



DISCUSSION 

The structural studies on the dimerization domain of 
NF90 and NF45 show that these proteins belong to the 
family of template-free nucleotidyltransferase enzymes. 
However, the lack of conserved catalytic residues 
suggests that the DZF domain encodes a 
'pseudotransferase' that is no longer able to catalyse 
transfer of nucleotides. Ligand-binding studies indicate 
that NF45, but not NF90, retains the ability to bind 
NTPs in a conserved binding pocket at the degenerated 
active site. Whether NTPs are the true ligands of this site 
remains to be established. 

The co-precipitation studies confirm two predictions 
based on the structure of the NF90/NF45 complex. We 
have demonstrated that NF45 is not able to homodimerize 
in a cellular context. This is most likely because of the 
intrinsic asymmetry of the dimerization interaction with 
NF90, as well as the potential steric clashes with the N- 
and C-terminal extensions of NF45. The second predic- 
tion was based on the observation that residues in NF90 
that are involved in binding are conserved in SPNR and 
Zfr. The prediction that NF45 should interact with both 
of these related proteins is borne out by the 
co-precipitation studies and shows that the interaction 
between Zfr and NF45 is present in mammals as well as 
in flies. Furthermore, this interaction is disrupted when 
these proteins are co-transfected with NF45 that is 
mutated at critical residues on helix 9, indicating that 
NF90, SPNR and Zfr use a conserved interface for their 
interaction with NF45. 
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Figure 5. Co-immunoprecipitation studies of NF45 with other DZF domain-containing proteins. (A) Overview of domain organization of 
DZF-domain containing proteins. (B) Co-immunoprecipitation studies carried out with HA-tagged proteins (1. HA-NF45, 2. HA-NF90, 3. 
HA-STRBP) and with differentially tagged pairs of NF45 (lane 4) and NF90 (lane 5). (C) Co-immunoprecipitation studies of NF45 wild-type 
(lanes 1, 3, 4 and 6) or NF45 mutant (lanes 2, 5 and 7) co-transfected with NF90 (lanes 1 and 2), SPNR (lanes 3,4 and 5) and Zfr (lanes 6 and 7). 
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These interactions help to explain the evolutionary re- 
lationship between NF90 and NF45. The DZF domains of 
NF90 and NF45 share ~30% sequence identity and a high 
degree of structural similarity. The complex that the two 
proteins form is very intimate and, based on our thermal 
denaturation assays and previous in vivo observations 
(46), NF45 is unstable in the absence of a binding 
partner both in vitro and in vivo. Both NF45 and Zfr are 
found in worms and insects and a Zfr-like protein that has 
only one annotated zinc finger is found, along with NF45, 
in early metazoans such as Trichoplax adherans and Ciona 
intestinalis. NF90 and SPNR, on the other hand, are only 
found in vertebrates. We would predict that NF45 is 
always found in a complex with NF90, SPNR or Zfr/ 
Zfr-like proteins. It is also possible that some early 
metazoa might have a symmetrical NF45 homodimer. 
Indeed, the symmetrical interactions at the core of the 
dimerization interface suggest that all of these complexes 
might have arisen from a homodimeric precursor. 

It is interesting that NF45, which retains a conserved 
ligand-binding site, is also the component that is able to 
participate in other protein complexes. That NF45 can 
participate in a variety of different complexes also has 
important implications for functional studies of all of 
the DZF proteins. Both NF90 and Zfr are widely ex- 
pressed in a variety of tissues and so NF90/NF45 and 
Zfr/NF45 complexes could co-exist in the same cells. 
SPNR shows a more restricted tissue distribution and is 
found primarily in brain and testes. However both NF90 
and Zfr are also present in neuronal cells, suggesting that 
in this system there is an even richer variety of this family 
of complexes present. Since levels of NF90 and NF45 are 
known to be co-regulated, caution should be exercised 
when overexpressing or knocking down individual 
proteins as this could potentially lead to a redistribution 
of other DZF domain complexes. 

Further cell-based, biochemical and structural studies 
are required to determine the functional role of the DZF 
dimerization domain in its various different guises. The 
dimer presents an extended protein surface that is highly 
conserved (Supplementary Figure S5). This extended 
surface could potentially provide a docking platform for 
protein-protein interactions. Furthermore, although the 
DZF domain is not required for dsRNA binding, it 
might nevertheless contribute to RNA recognition, either 
through the conserved ligand-binding site or through 
sequence-specific interactions. Future studies will eluci- 
date the role of this platform in the various biological 
roles proposed for NF90 and NF45. 
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